Ch06 - Partitioning

Partitioned databases were pioneered in the 1980s by products such as Teradata, rediscvoered by noSQL

Partitioning by Key Range

choice of hash key:
- Cassandra and MongoDB use MD5, Vodemort uses Fowler– Noll–Vo function.
- MurmurHash - non-cryptographic but lightweight
- built-in Java hash function is bad, because same key -> different hash
  - it's built for in memory hash tables and the behavior is a safety mechanism to prevent collding hash attacks
Consistent Hashing
- great for caches
- does not work great for DBs
range query difficult - Cassandra and DynamoDB uses hash + sort ey
hotspot relief: application level - route writes to multiple keys: key_random_num

local index - scatter writes, gather for eads
- Used by MongoDB, Riak, Cassandra, Elasticsearch, SolrCloud, and VoltDB
global index
- write reaches multiple partitions - distributed transaction or async update

Fixed number of partitions
- more partitions than nodes
- when new node join, steal partitions from others
- Used in Riak, Elasticsearch, Couchbase, and Voldemort
- hard to pick the right number of partitions
parition spliting and merging
# of partition proportionally to nodes
- when a new node joins the cluster, it randomly chooses a fixed number of existing partitions to split, and then takes ownership of half of each partition
- used by Cassandra

auto vs manual rebalancing
- full automation can be dangerous in combination with automatic failure detection

require consensus
- use zookeeper - HBase, SolrCloud, Kafka, Espresso
- gossip - Cassandra and Riak
- no autobalancing, routing tier - couchbase